Transparent, reproducible data.
نویسنده
چکیده
T he scientific research paper remains the main mode of sharing scientific information. Two key aspects contribute to the unabated success of this format, despite the myriad possibilities for new ways to share research in an online world: Independent validation by peer review and the juxtaposition of research results—presented in figures, tables, and videos—with the authors’ textual interpretation of these results. For many years, publishers have focused largely on rendering text for publication and search engines are limited to indexing the title, abstract, introduction, results, methods and discussion sections of a paper. Yet, the claims made in a paper rely on data selected by the authors and synthesized into figures that illustrate the findings for the human reader. Figures are the heart of the paper and should open a vista on the research data. Unfortunately, that view is currently often limited to data visualizations and illustrations. As we have reported previously (Lemberger, 2010; Pulverer, 2014a,b), we encourage the presentation of Source Data, and indeed links to Source Data are already available in many figures in the journal. Source Data provides access to minimally processed versions of the data underlying figures such as data files used to plot graphs and experimental replicates or less cropped versions of representative data shown in a figure panel. Source Data can also include higher resolution renderings of data compressed in figures for efficient downloading. Source Data provides a foundation to a paper that affords a much richer view of the findings discussed and enables reanalysis and reuse by the interested reader, as well as computational access. (The data are published under a CC-0 license.) It also provides an efficient means to archive data in close juxtaposition with the paper that discusses the findings. A central tenet of publishing is to ensure reproducibility. This requires the transparent and detailed reporting and sharing not only of the data, but also the underlying methods and reagents. Our goal is to develop method sections that satisfy these conditions, including detailed experimental protocols and reagents that can be identified unambiguously, and which link directly to the relevant figure panels and data. We are conscious that these enhancements will need to tie into the development of digital laboratory management tools that allow submission of this information without unreasonably burdening the authors.
منابع مشابه
New Approach in Interferometric Length Measurements
Interferometric length measurements without parallax error, based on the phenomenon of the reproducible wringing, are demonstrated. Using reproducible wringing together with the slave-block technique, accuracy level below 1 nm can be obtained for blocks of a few mm length with small flatness deviation. Phase change correction determination with sub-nm uncertainty is achieved for blocks up to 10...
متن کاملTowards a more reproducible ecology
of research papers. But this development also poses some important challenges. The large amount of project-specific software being generated for analytical studies means that analytical standards are harder to establish, potentially limiting the reproducibility of much of recently published science. Also, analytical and coding errors may escape detection, with potentially highly problematic res...
متن کاملResults May Vary: Reproducibility, Open Science and All That Jazz
How could we evaluate research and researchers? Reproducibility underpins the scientific method: at least in principle if not practice. The willing exchange of results and the transparent conduct of research can only be expected up to a point in a competitive environment. Contributions to science are acknowledged, but not if the credit is for data curation or software. From a bioinformatics vie...
متن کاملKinFin: Software for Taxon-Aware Analysis of Clustered Protein Sequences
The field of comparative genomics is concerned with the study of similarities and differences between the information encoded in the genomes of organisms. A common approach is to define gene families by clustering protein sequences based on sequence similarity, and analyze protein cluster presence and absence in different species groups as a guide to biology. Due to the high dimensionality of t...
متن کاملThe Relationship Between Non-Transparent Financial Reporting and Risk Stock Futures Fall Due to the Size and Performance
The purpose of this study was to investigate the relationship between stock futures fall risk with non-transparent financial reporting at three levels of size, efficiency and return on equity, in the period 2010 to 2014 was in Tehran Stock Exchange. The population of the study are all companies listed in Tehran Stock Exchange. Data collected and calculated by using Excel software Eviews 7 been ...
متن کاملEstablishing a framework for Open Geographic Information science
When conducting research within a framework of Geographic Information Science (GISc), the scientific validity of this work can be argued as highly dependent upon the extent to which the methods employed are reproducible, and that, in the strictest sense, can only be fully achieved by implementing transparent workflows that utilize both open source software and openly available data. After consi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- The EMBO journal
دوره 33 22 شماره
صفحات -
تاریخ انتشار 2014